Exploring Distributed Resource Allocation Techniques in the SLURM Job Management System

نویسندگان

  • Xiaobing Zhou
  • Hao Chen
  • Ke Wang
  • Michael Lang
  • Ioan Raicu
چکیده

With the exponentially growth of distributed computing systems in both flops and cores, scientific applications are growing more diverse with a variety of workloads. These workloads include traditional large-scale High Performance Computing MPI jobs, and ensemble workloads, such as Many-Task Computing workloads comprised of extremely large number of tasks of finer granularity, where tasks are defined on a per-core or per-node level, and often execute in milliseconds to seconds. Delivering high throughput and low latency for these heterogeneous workloads requires developing distributed job management system that is magnitudes more scalable and available than today’s centralized batch-scheduled job management systems. In this paper, we present a distributed job launch prototype SLURM++, which extends the SLURM resource manager by integrating the ZHT zero-hop distributed key-value store for distributed state management. SLURM++ is comprised of multiple controllers with each one managing several SLURM daemons, while ZHT is used to store all the job metadata and the SLURM daemons’ state. We compared SLURM with our SLURM++ prototype with a variety of microbenchmarks of different job sizes (small, medium, and large) at modest scales (500-nodes) with excellent results (10X higher job throughput). Scalability trends shows expected performance to be many orders of magnitude higher at tomorrow’s extreme scale systems. Keywords-job management systems; job launch; distributed scheduling; key-value stores

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SLURM: Simple Linux Utility for Resource Management

Simple Linux Utility for Resource Management (SLURM) is an open source, faulttolerant, and highly scalable cluster management and job scheduling system for Linux clusters of thousands of nodes. Components include machine status, partition management, job management, scheduling, and stream copy modules. This paper presents an overview of the SLURM architecture and functionality. 1 Overview Simpl...

متن کامل

Custom accounting for the Simple Linux Utility for Resource Management

An efficient way to manage the workload on High Performance Computing Clusters is to use Resource Managers. One such of the resource manager and job scheduling system for Linux clusters is the Simple Linux Utility for Resource Management (SLURM). At National Energy Research Scientific Computing Center (NERSC), users regularly submit and run hundreds of jobs. Thus, it is necessary to have an acc...

متن کامل

Integer programming based heterogeneous CPU-GPU cluster schedulers for SLURM resource manager

We present two integer programming based heterogeneous CPU-GPU cluster schedulers, called IPSCHED and AUCSCHED, for the widely used SLURM resource manager. Our scheduler algorithms take windows of jobs and solve allocation problems in which free CPU cores and GPU cards are allocated collectively to jobs so as to maximize some objective functions. Our AUCSCHED scheduler employs an auction based ...

متن کامل

Towards Next Generation Resource Management at Extreme-Scales

With the exponential growth of distributed systems in both FLOPS and parallelism (number of cores/threads), scientific applications are growing more diverse with various workloads. These workloads include traditional large-scale high performance computing (HPC) MPI jobs, and HPC ensemble workloads that support the investigation of parameter sweeps using many small-scale coordinated jobs, as wel...

متن کامل

Evaluating Scalability and Efficiency of the Resource and Job Management System on Large HPC Clusters

The Resource and Job Management System (RJMS) is the middleware in charge of delivering computing power to applications in HPC systems. The increasing number of computational resources in modern supercomputers brings new levels of parallelism and complexity. To maximize the global throughput while ensuring good efficiency of applications, RJMS must deal with issues like manageability, scalabili...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013